K | # of bigrams | # of trigrams | # of 4-grams | # of 5-grams | # of 6-grams |
---|---|---|---|---|---|
100 | 62 | 88 | 99 | 99 | 99 |
1000 | 148 | 465 | 700 | 851 | 924 |
10000 | 534 | 1429 | 3309 | 5659 | 7444 |
100000 | 1997 | 10607 | 23750 | 40809 | 58743 |
1000000 | 3950 | 30985 | 96986 | 183558 | 265190 |
Both the problem and the results are much similar to the previous subsection: We consider letter-N-grams at the end of words instead of the beginning.
3.8.1 Number of letter-N-grams at word beginnings